AITopics | multiplication problem

Collaborating Authors

multiplication problem

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Chain-of-Thought Tokens are Computer Program Variables

Zhu, Fangwei, Wang, Peiyi, Sui, Zhifang

arXiv.org Artificial IntelligenceMay-9-2025

Chain-of-thoughts (CoT) requires large language models (LLMs) to generate intermediate steps before reaching the final answer, and has been proven effective to help LLMs solve complex reasoning tasks. However, the inner mechanism of CoT still remains largely unclear. In this paper, we empirically study the role of CoT tokens in LLMs on two compositional tasks: multi-digit multiplication and dynamic programming. While CoT is essential for solving these problems, we find that preserving only tokens that store intermediate results would achieve comparable performance. Furthermore, we observe that storing intermediate results in an alternative latent form will not affect model performance. We also randomly intervene some values in CoT, and notice that subsequent CoT tokens and the final answer would change correspondingly. These findings suggest that CoT tokens may function like variables in computer programs but with potential drawbacks like unintended shortcuts and computational complexity limits between tokens. The code and data are available at https://github.com/solitaryzero/CoTs_are_Variables.

artificial intelligence, large language model, natural language, (17 more...)

arXiv.org Artificial Intelligence

2505.04955

Genre: Research Report > New Finding (0.66)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Language Models Do Hard Arithmetic Tasks Easily and Hardly Do Easy Arithmetic Tasks

Gambardella, Andrew, Iwasawa, Yusuke, Matsuo, Yutaka

arXiv.org Artificial IntelligenceJun-4-2024

The ability (and inability) of large language models (LLMs) to perform arithmetic tasks has been the subject of much theoretical and practical debate. We show that LLMs are frequently able to correctly and confidently predict the first digit of n-digit by m-digit multiplication tasks without using chain of thought reasoning, despite these tasks require compounding operations to solve. Simultaneously, LLMs in practice often fail to correctly or confidently predict the last digit of an n-digit by m-digit multiplication, a task equivalent to 1-digit by 1-digit multiplication which can be easily learned or memorized. We show that the latter task can be solved more robustly when the LLM is conditioned on all of the correct higher-order digits, which on average increases the confidence of the correct last digit on 5-digit by 5-digit multiplication tasks using Llama 2-13B by over 230% (0.13 to 0.43) and Mistral-7B by 150% (0.22 to 0.55).

digit, language model, llm, (12 more...)

arXiv.org Artificial Intelligence

2406.02356

Country: Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.97)

Add feedback

Solving the multiplication problem of a large language model system using a graph-based method

Tuncer, Turker, Dogan, Sengul, Baygin, Mehmet, Barua, Prabal Datta, Hafeez-Baig, Abdul, Tan, Ru-San, Chakraborty, Subrata, Acharya, U. Rajendra

arXiv.org Artificial IntelligenceOct-18-2023

The generative pre-trained transformer (GPT)-based chatbot software ChatGPT possesses excellent natural language processing capabilities but is inadequate for solving arithmetic problems, especially multiplication. Its GPT structure uses a computational graph for multiplication, which has limited accuracy beyond simple multiplication operations. We developed a graph-based multiplication algorithm that emulated human-like numerical operations by incorporating a 10k operator, where k represents the maximum power to base 10 of the larger of two input numbers. Our proposed algorithm attained 100% accuracy for 1,000,000 large number multiplication tasks, effectively solving the multiplication challenge of GPT-based and other large language models. Our work highlights the importance of blending simple human insights into the design of artificial intelligence algorithms. Keywords: Graph-based multiplication; ChatGPT; Multiplication problem

graph-based method, language model system, multiplication problem

arXiv.org Artificial Intelligence

2310.13016

Genre: Research Report (0.40)

Industry: Education > Curriculum > Subject-Specific Education (0.60)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.44)

Add feedback

Did GoogleAI Just Snooker One of Silicon Valley's Sharpest Minds?

#artificialintelligenceSep-15-2022, 03:10:36 GMT

In 1904, the horse du jour was Clever Hans, widely reputed to be so much smarter than his brethren that he could do math, tell time, and even read and spell. Word spread fast by word of mouth, and eventually the occasionally gullible The New York Times reported that Hans was so smart that he "can do almost everything but talk". Ask Hans what 12 plus 13 is, and he would stamp his feet 25 times. People were amazed, and paid good money to see him. Turns out the horse knew no math; it had solved the arithmetic problems--all of them --in a different way.

artificial intelligence, compositionality, machine learning, (19 more...)

#artificialintelligence

Country: North America > United States > California (0.41)

Industry:

Education (0.69)
Information Technology (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback